Sequential Sampling to Myopically Maximize the Expected Value of Information
نویسندگان
چکیده
Statistical selection procedures are used to select the best of a finite set of alternatives, where “best” is defined in terms of each alternative’s unknown expected value, and the expected values are inferred through statistical sampling. One effective approach, which is based on a Bayesian probability model for the unknown mean performance of each alternative, allocates samples based on maximizing an approximation to the expected value of information (EVI) from those samples. The approximations include asymptotic and probabilistic approximations. This paper derives sampling allocations that avoid most of those approximations to the EVI, but entails sequential myopic sampling from a single alternative per stage of sampling. We demonstrate empirically that the benefits of reducing the number of approximations in the previous algorithms is typically outweighed by the deleterious effects of a sequential one-step myopic allocation when more than a few dozen samples are allocated. Theory clarifies the derivation of selection procedures that are based on the EVI.
منابع مشابه
Hierarchical Knowledge Gradient for Sequential Sampling
We propose a sequential sampling policy for noisy discrete global optimization and ranking and selection, in which we aim to efficiently explore a finite set of alternatives before selecting an alternative as best when exploration stops. Each alternative may be characterized by a multi-dimensional vector of categorical and numerical attributes and has independent normal rewards. We use a Bayesi...
متن کاملA Knowledge-Gradient Policy for Sequential Information Collection
In a sequential Bayesian ranking and selection problem with independent normal populations and common known variance, we study a previously introduced measurement policy which we refer to as the knowledge-gradient policy. This policy myopically maximizes the expected increment in the value of information in each time period, where the value is measured according to the terminal utility function...
متن کاملAdaptive Policies for Sequential Sampling under Incomplete Information and a Cost Constraint
We consider the problem of sequential sampling from a finite number of independent statistical populations to maximize the expected infinite horizon average outcome per period, under a constraint that the expected average sampling cost does not exceed an upper bound. The outcome distributions are not known. We construct a class of consistent adaptive policies, under which the average outcome co...
متن کاملStatistical Matching in ‘Labour Force’ and ‘Time Use’ Surveys
Abstract. In the National Statistical Systems, some parts of the official statistics required for development planning and evidence base management are collected through sampling survey. Due to the high cost of sampling process, as well as nonresponses, it may not be possible to collect all of the expected variables in a survey.In order to obtain a comprehensive and complete source o...
متن کاملWoodroofe ’ S One - Armed Bandit Problem Revisited
We consider the one-armed bandit problem of Woodroofe [J. Amer. Statist. Assoc. 74 (1979) 799–806], which involves sequential sampling from two populations: one whose characteristics are known, and one which depends on an unknown parameter and incorporates a covariate. The goal is to maximize cumulative expected reward. We study this problem in a minimax setting, and develop rate-optimal police...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- INFORMS Journal on Computing
دوره 22 شماره
صفحات -
تاریخ انتشار 2010